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Abstract —This paper studies online power control policies 
for outage minimization in a fading wireless link with energy 
harvesting transmitter and receiver. The outage occurs when 
either the transmitter or the receiver does not have enough 
energy, or the channel is in outage, where the transmitter only 
has the channel distribution information. Under infinite battery 
capacity and without retransmission, we prove that threshold- 
based power control policies are optimal. We thus propose 
disjoint/joint threshold-based policies with and without battery 
state sharing between the transmitter and receiver, respectively. 
We also analyze the Impact of practical receiver detection and 
processing on the outage performance. When retransmission is 
considered, policy with linear power levels is adopted to adapt the 
power thresholds per retransmission. With finite battery capacity, 
a three dimensional finite state Markov chain is formulated to 
calculate the optimal parameters and corresponding performance 
of proposed policies. The energy arrival correlation between 
the transmitter and receiver is addressed for both finite and 
infinite battery cases. Numerical results show the impact of 
battery capacity, energy arrival correlation and detection cost 
on the outage performance of the proposed policies, as well 
as the tradeoff between the outage probability and the average 
transmission times. 

Index Terms —Energy harvesting, outage minimization, circuit 
power, power control, finite state Markov chain. 


I. Introduction 

Wireless transceivers powered by renewable energy are be¬ 
coming more and more appealing due to their ease of deploy¬ 
ment and environment friendliness IT], of which the feasibility 
is confirmed by measurements |3. When the transceivers in 
wireless links are powered by energy harvesting (EH), due to 
the randomness of energy arrivals, the transmission can fail 
when the energy stored in the battery is insufficient, and thus 
transmission reliability and throughput are degraded. This calls 
for new ways of power control policies adapting themselves to 
not only the channel fading, but also random energy arrivals 
at both the transmitter and the receiver. 
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There have been some recent studies about EH-based 
wireless transmissions, specifically dealing with the multi¬ 
fold randomness from energy arrival, data arrival and channel 
fading, and their main focuses are on EH transmitters. With 
infinite battery capacity, optimal power control policies are 
studied in 0 and a to stabilize the system under random 
packet arrivals. Ref. a proves that the threshold-based power 
control is optimal for single link outage minimization over 
finite time horizon, and is asymptotic optimal for infinite 
time horizon. Ref. |6l targets at minimizing the link outage 
probability with interference from other transmitters, where 
the author uses random walk theory to find the transmission 
probability of the EH transmitter. With finite battery capacity, 
optimal offline water-filling algorithms are developed in iTl- 
0, where the optimal directional water-filling is found based 
on the causality constraint of using the harvested energy, and 
suboptimal online policies are also investigated. Our previous 
work m extends their work to the case with the hybrid 
transmitter powered by EH and power grid. Ref. HD identifies 
the optimal power control policy and corresponding perfor¬ 
mance limit with finite data and energy storages under concave 
utility functions. Besides the transmission power, some recent 
studies m-M deal with the circuit power consumption 
at the transmitter, and ref. ca addresses the practical half¬ 
duplex constraint on the actions of energy harvesting and 
transmission. To improve the reliability, there have been recent 
efforts on analyzing packet retransmission mechanisms for EH 
communication systems M, and ref. ifTTll studies an AQR- 
based retransmission mechanism and applies Markov Decision 
Processes (MDP) to obtain the optimal power control policy. 

However, most of the previous works do not consider the 
existence of EH receiver, while the energy consumption for 
receiving cannot be ignored in applications like wireless sensor 
networks (WSN) with short transmission distances lfT8ll - ll20l . 
This motivates us to investigate the wireless link consisting 
of EH transmitter and receiver, with circuit blocks, as well as 
the decoding modules ifTSl taken into account. It is noted that 
some works deal with two-hop transmissions with EH source 
and relays a ED ED, but the energy harvested at the relay 
is used for the second hop transmission, not for receiving. 
Among the few, ref. considers receiving power for multi¬ 
hop WSN with EH, and efficient routing and node sleeping 
algorithms are proposed. With another line of research, energy 
cooperation with wireless powering is considered ll23l ll24ll . 
where the receiver is powered by the energy transmission from 
the transmitter. The main difference is that in their cases, the 
transmitter intentionally delivers the energy on demand to the 
receiver, while in our case, both sides may have no knowledge 
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of the energy condition of the other. 

In this paper, we focus on the outage minimization problem 
of a fading wireless link with EH transmitter and receiver, 
under an acknowledgement (ACK) based retransmission mech¬ 
anism. Only channel distribution information (CDI) is assumed 
known at the transmitter. Different levels of the battery state in¬ 
formation (BSI) sharing between the two nodes are considered. 
As both nodes are EH-based, it is also nature to investigate 
the correlations between the energy arrival processes at both 
sides. Different from ED, where static channel is considered 
and the goal is capacity maximization, we put more focuses on 
the outage performance of EH links, considering the two-fold 
randomness from EH and channel fading. The main objective 
of our research is to find the optimal power control policy, and 
to investigate how the energy arrival correlation, the battery 
capacity and different receiver structures can impact the outage 
performance of a link. The main contributions include: 

1) With infinite battery capacity and no retransmission, we 
prove that threshold-based power control is optimal when 
the EH process is independent between the transmitter and 
receiver. The optimality holds regardless of whether BSI is 
shared or not. We thus provide the closed-form expressions for 
the optimal power thresholds of the disjoint policy without BSI 
sharing, and for the joint policy with BSI sharing, respectively. 
We further investigate the impact of EH correlation between 
the two nodes, and show that the joint threshold-based policy 
is still optimal. 

2) Based on queuing theory, we provide a simple yet 
powerful framework of deriving the transmission probability, 
and the receiving probability of EH links. Eor the transmission 
probability, the assumption on the i.i.d. EH process in El 
is generalized to stationary and ergodic EH process. The 
derived transmission and receiving probability further enable 
directly proving the optimality of the threshold-based policies, 
which extends the results in 0 to the case when both the 
transmitter and receiver are EH-based, with the consideration 
of EH process correlation between the two sides, the circuit 
power, the receiver detection and processing, different levels 
of BSI and CSI sharing. We also investigate the case when the 
receiver knows the policy of the transmitter, and show how this 
information helps improving the system performance. 

3) When retransmissions are taken into account, we 
show that the derived optimal power thresholds for non¬ 
retransmission case are still near-optimal. To further adapt 
the threshold per retransmission, a modified disjoint threshold- 
based policy called linear power levels policy is adopted, and 
the choice of the starting power level is investigated. 

4) Under finite battery capacity, the proposed policies are 
analyzed via a three dimensional finite state Markov chain 
(ESMC) model, which allows the calculation of the optimal 
power thresholds and evaluation of their performance. The 
numerical results via the ESMC analysis provide insights to 
the impact of the system parameters, especially the battery 
capacity and EH cotTelation between the transmitter and 
receiver, on the outage performance of an EH link. 

The remainder of this paper is organized as follows. Section 
ttn sets up the system model, and three power control policies 
are introduced in Section m In Section IV, we analyze and 


prove the optimality of the policies with infinite battery ca¬ 
pacity. We investigate our policies with finite battery capacity 
using ESMC in Section V, and we propose the cotTesponding 
local searching algorithm for the optimal thresholds. Numer¬ 
ical results are presented in Section VI, and are followed by 
our conclusion in Section VII. 

H. System Model and Problem Eormulation 

We consider a fading wireless link consisting of one trans¬ 
mitter (denoted by source node S) and one receiver (denoted 
by destination node D). Both nodes are powered by batteries 
of capacity Smax and are capable of harvesting energy from 
the surrounding environment. Discrete-time model is used, 
where the time axis is partitioned into slots denoted by 
t G {0,1, 2, • • ■ }. With unit slot length, we will use energy 
and power, and their corresponding units interchangeably. 


A. Channel Model 


We consider frequency non-selective block Rayleigh fading 
channel, where the channel h keeps constant within one time 
slot, and varies independently from slot to slot, and E{|ft,p} = 
1. There is only CDI at S, while D has CSI. The transmission 
rate R is assumed fixed. We assume unit bandwidth, and thus 
R is in the unit of spectrum efficiency. Channel outage occurs 
when the mutual information between S and D is less than 
R, where we assume one packet per slot. The corresponding 
channel outage probability Pr ||/ip < | in any slot t 

is l28l 


P{PL) = 


1 — exp 
1 , 


(2« - l)z\ 

PL ) ’ 


PL > 0 , 
PL = 0 , 


( 1 ) 


where Pj* is the transmission power at the S in slot t, and z 
denotes the power of additive white Gaussian noise. 


B. Energy Consumption Model 

With the consideration of circuit power, the power consump¬ 
tion at S in slot t is given by 

Ps = (1 + Q^)^tx + ^c,slp,(>oi (2) 

where Pc,s denotes the static power consumption of the circuit 
blocks, and (1 -f a)PL represents the power consumption for 
the power amplifier, where a > 0 mainly relates to the drain 
efficiency. The indicator function 1^ equals to 1 if event A 
is true, otherwise it equals to 0. Assume that if S is not 
transmitting, i.e., = 0, the circuit does not consume any 

power, i.e., Pg = 0. In practise, transceivers may consume 
some power even when it is not transmitting or receiving 
im, therefore this assumption implies that S can switch on 
and off the transceiver instantaneously, or the start-up time 
is negligible. Similar assumption is made for D. Note that 
at the beginning of each slot, D does not know whether 
S is transmitting or not in advance. Therefore if there is 
enough energy in the battery, D will be active, and its power 
consumption in slot t is given by 

ph = pd, 


(3) 
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where Pd includes two parts: (1) The power consumption of 
the receiving circuit, and the decoding module depending on 
whether the link is coded or uncoded 1201 : ( 2 ) the power for 
feeding back ACK to S, which will be described in Section 

ttED] 

To further improve the energy utilization efficiency at D, 
in Section IIV-CI we consider that D first spends ^ fraction 
of the energy ^Pd to detect whether there is an on-going 
transmission from S, where ^ G [0,1]. In this way, if P is 
active, the power consumption at P in slot t is 


(Fn, P|=0, 
Pd, P|>0, 


(4) 


where the first case means that S is not transmitting, and thus 
P will switch off after consuming ^Pd amount of detection 
energy; otherwise in the second case P will keep receiving for 
the whole slot when S is transmitting. The power consumption 
model can be further extended to include data processing 
power at P, which will also be discussed in Section IIV-CI 


C. Energy Harvesting Model and Battery State Evolution 
Denote the harvested energy in slot f at S' and P as 
{Pg},{P^} C K.>o, which are stationary and ergodic 
sequences over time. Their means are E [Eg] = Xs and 
E [P^] = Ad respectively. The EH processes can also be 
correlated between S and P, and the correlation coefficient 
is defined as p = ^vjjere cov(-) is the covariance 

function, (Td| represent the standard variances of 

Pg and P^. Let Bg and denote the battery energy at the 
beginning of slot t, of S and P respectively. The battery gets 
replenished whenever a node harvests energy at the end of the 
slot. The evolution of the battery states at S and P are 

= min {B^g -P*g + B^,^) , (5) 

= min (P ‘5 + P„,ax) , ( 6 ) 

where Pmax denotes the battery capacity. Note that the above 
equation implicitly requires Bg > Pg and P^ > Pd 
respectively, and these will be enforced by the power control 
policies, where we remark that S and P know their own 
battery state. Remark that the requirement P^ > Pd holds 
even for the case with receiver detection described in ©. 
This is because at the beginning of each slot, as P does not 
know whether S is transmitting or not, it has to ensure that 
P^ > Pd, to guarantee the reception for the whole slot in 
case S is transmitting. 


D. Packet Retransmission 

In slot t, the packet transmission may fail if: (1) S does 
not have sufficient energy to transmit the packet so that S set 
P(* = 0; (2) The packet is not transmitted successfully due 
to the channel outage with probability p(Pt(); (3) P does not 
have sufficient energy to receive the packet, i.e., P^ < Pd. 
Equivalently, the per-slot transmission successful probability 
(j) is given by 

(j) = Pr{no channel outagejP,* > 0}Pr{P(‘ > 0, P^ > Pd} 
= [l-p(P.‘)]Pr{P.‘ >0,P15 >Pd}, 

( 7 ) 


where the second equality comes from the independence 
between the battery states and the channel condition, and note 
p(P(*) is defined in ([T]i. 

The retransmission is based on an ACK-based mechanism, 
in which ACKs are fed back by P over an error free channel 
to S. If the transmission is successful, S will receive an ACK 
from P and move on to the next packet. If S does not receive 
an ACK or it does not transmit due to insufficient battery 
energy, it retries until it receives an ACK, or reaches the 
retry limit and moves on to the next packet. We assume: (1) 
The time for transmitting ACK is negligible; (2) The power 
consumed for sending ACK is included in Pd; (3) If channel 
outage occurs, even without feeding back ACK, the power 
consumption of P is still Pd. 

As we allow K — 1 times of retransmission (K times 
of transmission attempts in total), the retransmission state is 
denoted hy u GU, defined by 

{ — 1, ACK received, start a new packet 
0 , last packet in outage, start a new packet 
fc, fc-th retransmission, k G {1, • • • , K — 1}. 

( 8 ) 

With retransmissions, the final outage probability of a packet 
Pout is the probability that the packet can not be successfully 
transmitted within AT slots. Please note that with our definition, 
the “retransmission” includes the failed transmission attempts 
due to insufficient energy at S. 


E. Outage Minimization Problem 

This paper considers the online power control policies at S 
in order to minimize the packet outage probability pout^ 

nun Pout, (9) 

where V denotes the set of all online stationary policies, where 
in a policy S decides Pg, based on CDI, average energy arrival 
rate As, its BSI (or including Ad and B]^ at P depending on 
the level of BSI sharing) and the retransmission state u. In 
this paper we assume the data source of S is backlogged. 
Our analysis for the backlogged scenario is also valuable for 
cases with random packet arrivals, in terms of determining 
the stability condition of the data queue. Specifically, to keep 
the data queue stable, the bit arrival rate should be less than 
R{1 — Pout)- Even though the energy buffer and the data queue 
are coupled at S, this stability condition is proved in m 
by assuming S transmits dummy packets even when the data 
queue is empty. 


III. Power Control Policies 

In this section, three power control policies are presented. 
Their performance will be analyzed in later sections. Note that 
by default P does not know what policy S is taking, and we 
will discuss the impact of knowing the policy of S' at P in 
Section IIV-DI 
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A. Disjoint Threshold-based Policy 

In this policy, the possible values of Pg are chosen from a 
binary set and S only transmits when the energy stored in its 
own battery is larger than a threshold. Specifically, 


Ps = 


Ps, lfBs>Ps, 

0 , else, 


( 10 ) 


where according to (|2|, Ps > Pc,S is required, and the 
corresponding transmission power is Pj* = —The 
battery state evolution under this policy at S is given by 


— min(Pg — Pgl^t +p 5 ,Pmax)- (11) 

The optimization variable of the policy is to determine the 
best threshold Pg. Note that in this policy, each node does 
not know the BSI of the other. 


B. Joint Threshold-based Policy 

This is a genie-aided threshold-based policy that each node 
knows whether the other node will transmit/receive or not in 
current time slot. Under this policy, S transmits only when 
both Pg > Ps and Pp > Pp hold, and so does D to 
make receiving decision. The battery state evolution under this 
policy is 

P^^^ = min(P^ — P/3'^B*g>Ps,Bj,>PD + Bmax), (12) 

where (3 G {S, Dj. The optimization variable of the policy is 
the threshold P 5 . 


C. Linear Power Levels Policy 

The previous two policies do not take the retransmission 
states into account, while this policy is a modified version of 
the disjoint threshold-based policy with non-decreasing multi¬ 
level thresholds {Ps{u)}, u GU, i.e., 

Ps{u) = Ps{0) + ^ ■ u, u = 0,...,iT-l, (13) 

where A is the step value, and note Ps{—1) = Ps(0). The 
battery state evolution at S under this policy is given by 

Pg+^ = min(Pg — P 5 (u)lpt -I- Es^Bmax)- (14) 

The optimization variables of the policy are Ps(0) and A. 

IV. Infinite Battery Capacity 

In this section, we will analyze the outage probability when 
both nodes are equipped with infinite battery capacity. The 
optimality of the threshold-based policies is discussed. Unless 
otherwise specified, the theorems, lemmas, propositions and 
corollaries have the following default conditions: The battery 
capacity Pmax = + 00 . The energy arrivals {Pg} and (Pp) 
are stationary and ergodic processes, with E [Pg] = As and 
E [Pp] = Ap respectively. 


“Source” 


E, 


] 


T t 

Esource Ps 


“Queue” 




/ 



Tx WPc 


Packets 


Eig. 1. The “energy packet” queue. 


A. Disjoint Threshold-based Policy with K = 1 

When P = 1, there is no retransmission. According to 
the definition in 0 , we denote (j>{Ps) as the transmission 
successful probability in a slot, with threshold Ps. The policy 
design is to find the best threshold Ps so that (f>{Ps) is 
maximized, written as 


(PI) max (j){Ps) — exp 
Ps>0 




T'(Ps,Pr3), (15) 


where recall that Px = (Ps — Pc,s) /(I + «)> and T'(Ps, Pd) 
is the probability that S has enough energy to transmit and D 
has enough energy to receive, i.e.. 


'k(Ps,Pp)= lim — E 

n.—^-l-on n < ^ 




>Ps,B* >Pi3 


(16) 


To solve (PI), we need to calculate T'(P 5 ,Pp), before 
which two lemmas are proposed. 

Lemma 1: Denote the transmission probability of S as 

1 ” 

T's = lim — E 

n—¥-\-oo Tl 

Under the disjoint threshold-based policy, we have 

T-S = min ^1, . (18) 

Proof: Without loss of generality, assume initially the 
battery is empty, i.e., Pg = 0. The proof is based on 
constructing a equivalent energy “packet” queue Q reflecting 
the energy arrival and usage process. The queue operates in a 
slotted manner. At the end of each slot t, the arrived energy 
is firstly stored in a “packet source”, of which the energy at 
the beginning of a slot is Right at the energy arrival, 

the “source” generates energy “packets”, each with volume 
Ps, into Q at the end the slot. Correspondingly the number 
of generated packets is n* = [ J ([.J is the floor 

operator, and ut G Z>o), and then the amount of energy 

BsJurce = PLurce ~ ^tPs + Es (19) 

remains in the “source”, and here we always have < 

Ps- Note that {rit} solely evolves with the stationary and 
ergodic sequence {Pg}, and thus is also stationary and er¬ 
godic. The generated packets are then buffered in Q with 
infinite capacity. The constructed queue is illustrated in Fig. (Tj 
The “energy packet” arrival rate of Q, defined as the average 


Lb‘>P s 


(17) 
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arrived packets per slot, is 


'^ep — 


(a) 


(b) 


lim 

n—>-+oo 

-p— lim 

Pg n—>-+o 

As 

Ps’ 


ELiEN 




( 20 ) 


where (a) holds by adding the two sides of ( fTOl l from t = 
1 to t = n and then taking expectations, with initial values 
^source = = 0 ; (b) holds because < Ps- 

The “server” of Q serves one packet per slot if the queue 
is not empty, i.e., the service rate is /iep = 1 , this corresponds 
to the policy that S transmits if Bg > Ps- Therefore, denoted 
by Nt the number of energy packets in Q at the beginning of 
time slot t, its evolution is 


Nt+i — Nt — lwt>i + n-t, ( 21 ) 


Comparing with (|5ll and noting that we have infinite battery 
here, it can be easily checked that 


Therefore, 


Nt = 


Ps 


lim — E 

—v-l-rio m ^ 


n—>+oo 7T, 


'-B%>Ps 


lim -X]E[lAr,>i] 

-^4-00 n 


n^+cxD n 




( 22 ) 


(23) 


where the last equality holds due to (l22ll . For '^ 5 , First 
consider Aep > 1 , and we prove T's = 1 by contradiction. 
Assume '^5 < 1, and thus for some subsequence of {iVt}, 
denote slot sequence {ti} C N as those satisfy = 0, 
and ti < t 2 < ■ ■ ■, where the sequence {ti] must be infinite 
(and thus {iVt} is stationary and ergodic), otherwise T'g = 1 
immediately. Note Vfi > 1, we must have = 0, and 


t — —2 

^ ^ p-t ~ ft+i 

t=ti 


-L-l, 

if fz + l > + 1 ; 

(24) 

nti = 0 , 

if fz+i — ii I 5 

(25) 


where the above equations basically indicate that between two 
adjacent slots with empty queue Q, the number of arrived 
packets equals to the number of slots between these two slots, 
because the queue evolves from empty to empty, and each slot 
exactly consumes one packet. Then 

lijn ^*=1 = lijn ' Et=l lA''t+i=0 + Et=l lA^t+i>0 

n—77, n—¥^oo 77, 

= a.s., (26) 

while lim„^+oo = Ep almost surely as {n*} is 

stationary and ergodic, which means Aep < 1 as we assume 
^*5 < E which contradicts the fact that Agp > 1. Therefore 
T-s = 1 . 


When Aep < 1, according to Little’s Law |[30], 

lim„_j,+oo ^ E"=i E [lArt>i] is the server busy probability of 
Q, and equals to the load of the queue, i.e.. 


Mep 


As 

Ps' 


(27) 


where the equality holds due to (l20l) and jiep = 1- The proof 
is completed. ■ 

Note that Lemma [T] coincides with 0 Theorem 1], while 
we provide an alternative proof based on queuing theory, and 
the i.i.d. assumption of {E^} is relaxed to be stationary and 
ergodic. This method further allows us to prove other results 
in this paper with more complicated situations especially for 
D. 

Lemma 2: Define the receiving probability of D as 


T* n = lim 


hoo n 


n 


B*>Pd 


(28) 


Under the disjoint threshold-based policy, we have 


'!'£) = min 



(29) 


Proof: Follow the same procedure of proving Lemma [T] 


We then have the following Theorem for the simultaneous 
transmission and receiving probability T'(P 5 ,Pd) when the 
energy arrivals at S and D are mutually independent, i.e., 

p = 0. 

Theorem 1: When EH processes of S and D are mutually 
independent, and BSI is not shared. We have 'S{Ps,Pd) under 
the disjoint threshold-based policy given by 


'^>{Ps,Pd) 


min 


^ ^ ^ AgAi) A 
’ Ps' Pd' PsPdJ 


(30) 


Proof: Since {Eg} and {Ef)} are mutually independent, 
and also because S and D do not know the BSI of each other, 
the events B^g > Ps and B{j > Po are independent. We have 


T'(Es,Pd) = (31) 


and according to Lemma [T] and Lemma |2] we directly obtain 


T'(Es,Pd) 


min 





= min 


, ^ ^ AgAj) A 

' Ps' Pd' PsPdJ ' 


(32) 


which completes the proof. ■ 

When energy arrivals {El} and {Ej^} are not independent, 
i.e., p ^ Co Theorem [T] still holds if max(T' 5 , vki)) = 
1. In this case. Theorem [T] becomes T'(P 5 ,Pd) = 

min^l,^,^^. Otherwise, if max('I's, T'^i) < 1 which 
means that energy arrivals are insufficient at both nodes, it 
is expected that 'I'(Ps,Pd) increases with p, which will 
be confirmed by our numerical results. But this will highly 
depend on the energy arrival profiles at both nodes, and no 
closed-form 'i’{Pg,P£)) is available here. 


*The energy arrival correlation p can take negative values under some 
circumstances, while generally it is positive as for wind and solar energy. 
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Next we will show that the disjoint threshold-based policy 
is actually the optimal power control policy under independent 
energy arrivals. First two lemmas are needed as follows. 

Lemma 3: Consider a non-EH block fading channel with 
only CDI at S. Given the average power constraint E{Pg} < 
Pq at S, the optimal power control policy that minimizes the 
average channel outage probability, is threshold-based, i.e.. 


where 


Pi = 


0 , 

P*, 


with probability 1 


p* ’ 


with probability 




^ 0 , PO > Pa, 

Pa, Po < Pa, 


(33) 


(34) 


where Pa is the larger solution to the quadratic equation (P — 
Pc.s? = il + a){2^-l)zP. 

Proof: See Appendix lAl ■ 

We then immediately have the following lemma. 

Lemma 4: Consider a block fading channel with non-EH D 
and EH S with stationary and ergodic energy arrivals {Eg}, 
and E [Pg] = As. Assuming only CDI at S, the optimal power 
control policy that minimizes the average channel outage 
probability is threshold-based, the same as the corresponding 
non-EH link with average power consumption at S as As. 

Proof: Let Pa be the same notation as in Lemma |3] 
If P Pa, then P* = As. Otherwise, let the policy be 
transmitting whenever Bg > Pa, according to Lemma [T] the 
transmission probability is ^ks = ^. This policy is the same 
as the optimal policy in Lemma|3]by replacing Pq with As. As 
EH imposes more strict power allocation causality constraints, 
the performance of the EH link is no better than the non-EH 
link with the same average power consumption. Therefore, this 
policy is optimal for the EH link. ■ 

The above lemma coincides with ||5] Proposition 3.5], while 
we consider the circuit power Pc,s of S, and we also provide 
a direct proof for inhnity time horizon without taking the 
asymptotic to the solution of the hnite time horizon case. Now 
we are ready to prove the optimality of the disjoint threshold- 
based policy. 

Theorem 2: When EH processes of S and D are mutually 
independent, and BSI is not shared. Assume no retransmission. 
Among all stationary power control policies, the disjoint 
threshold-based policy is optimal, with optimal threshold Pg 
as 

P 5 = max(As,Pth), (35) 


where Pth is given by 


Pth 


(2Pc,s + c) -I- c 2 (4Pc,s + c) 2 , 


and c = ( 2 ^ — 1)(1 + a)z. 

Proof: See Appendix iBl ■ 

The theorem also reveals that the optimal threshold does not 
depend on \j^, and thus the optimality holds even when energy 
arrival statistics are not shared between S and D. 


B. Joint Threshold-based Policy with K=1 

Under the joint threshold-based policy, S and D know 
whether the other node will transmit or not, and the battery 
state evolution is shown in (Ell. Under this policy, we have 
the following proposition for the probability T'(Ps,Pd). 

Proposition 1: The probability '^{Ps,Po) under the joint 
threshold-based policy is 

4'(Ps, Pd) = min ^1, (36) 

Proof: See Appendix ICl ■ 

Using above Proposition, we have the following theorem. 

Theorem 3: Assuming no retransmission, and S and D 
know whether the other node will transmit or not. Among 
all stationary power control policies, the joint threshold-based 
policy is optimal. The optimal threshold Pg is 

Pg = max ^As, Pth,’ (37) 

where P* is the same as in Theorem |2l 

Proof: See Appendix iDl ■ 

Remark that from Proposition [1] and Theorem [3l the joint 
threshold-based policy is not affected by the correlation be¬ 
tween {Eg} and {P^}, and its performance and the optimal 
transmission power threshold are the same regardless of the 
value of p. 


C. Receiver Detection and Processing 

This section will provide some extensions to the receiver 
power consumption model. We will first demonstrate a more 
practical detection structure, and then the energy for process¬ 
ing the received data will be included in the model. 

As explained in Section Hl-Bl D can spend ^ portion of Pd 
to detect whether S is transmitting, rather than receiving for 
the whole slot. Note that when ■C = 1, node D has to receive 
for the whole packet to decide whether S is transmitting or 
not, which corresponds to the model we discussed previously. 
Under disjoint threshold-based policy, we have the following 
proposition with receiver detection. 

Proposition 2: Dehne the detection and receiving probabil¬ 
ity of D as 


T'd = lim 


■Ee 

t=l 


'-B\.>Pd 


(38) 


Under the disjoint threshold-based policy, we have 

where ^ is the detection cost, and 'ks = min ^ 1 , 

Proof: See Appendix IE] ■ 

Prom (l39b . we can see that 'kD < which is the receiving 
probability without detection, and this means that the detection 
increases the receiving probability. If T's < 1, this gain 
increases when ^ decreases. Note that now the events 1 d^>Ps 
and 1 d^>Pp are no longer independent, since the energy 
consumption procedure at D is affected by how energy is 
used at S. Therefore the probability T'(P 5 ,Pd) can not 
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Fig. 2. Simulation results for ^(P^, Po) with detection cost ^ and process¬ 
ing, compared with ^'(1, d)^ for Ps = 700mW, Pd = 600mW, 

2 = lOOmW, p = 0, = + 00 , and \g = Xd = 500mW. Here 

^5 = and is according to lOD for detection only with Pd = Pp^ 
according to SD for detection and processing with Pp > Pp, respectively. 


be calculated by (l32T i. But from simulations, as shown in 
Fig. |2] when 4's < 1 and < 1, '^{Ps, Pd) can be 
well approximated by for detection only. As a result, 

following the same procedure as in the proof of Theorem |2] 
one can still get the threshold Ps that is close to optimal. 
For joint threshold-based policy, the detection is actually not 
necessary, or at the optimal operation point, the joint threshold- 
based policy is equivalent to the case with ^ = 0. For the linear 
power levels policy, the optimal performance with receiver 
detection can be solved by FSMC decribed in the next section. 

Moreover, for example in sensor networks, D may use more 
power to process the successfully received data in addition 
to Pd- For instance, D may need to compress and store the 
sensed data from S, or relay the data to the next hop, etc. If the 
precessing is performed immediately after the data is received, 
or concurrently with receivingll, we can denote the total power 
consumption as Pp, and correspondingly Pd = rjPF, where 
T] G (0,1]. In this way, at the beginning of a slot, the battery 
energy should satisfy > Pp, because the battery should 
support D to receive and process the data, otherwise there is 
no need to receive. The power consumption at D in slot t is 

r ipPp, Pi = 0, 

/ rjPp, PI > 0 and channel outage occurs, (40) 

[ Pp, else. 

where the first case means that S is not transmitting, and thus 
D will switch off after consuming ^Pd = ^riPp for detection; 
In the second case when S is transmitting, D will keep 
receiving for the whole slot but the channel encounters outage, 
so that only the receiving power Pd = pPp is consumed and 
no further processing is needed; Finally in the third case, D 
successfully receives the data and completes the processing. 

With receiver detection and processing, the probability T 'd 
is the probability that > Pp. We have the following 

^Basically it requires that the processing is completed within the same slot. 


proposition for tk d with receiver detection and processing. 


Proposition 3: Define the detection and receiving probabil¬ 
ity of D as 


1 

4/ £) = lim — E 

n—J' + OO Jl 


Lb* >Pf 


(41) 


Under the disjoint threshold-based policy and with receiver 
detection and processing, we have 


D = min I 1 




[1 - (1 -Ct?) (1 - 'ks) -[i-v)p{Ws\Pf 


(42) 

where ^ and p are the detection cost and the receiving fraction 
respectively, and 4*5 = min{l, ^}, and p(Ptx) is the channel 


Ps — Pc,s 


outage probability defined in ([T]), with Ptx = ——■ 

Proof: See Appendix |F1 ■ 

Since the dependence between the events and 

^b\,>Pd introduced due to the receiver detection and pro¬ 
cessing, the probability '^{Ps,Pd) can not be calculated by 
(l32l i. As shown in Fig.|2] we can still approximate 4'(P5, Pd) 
with (l32l i. but the precision becomes worse when we introduce 
processing, i.e., Pp > Pp- 


D. Policy of S Known at D 

In previous subsections, D has no knowledge about whether 
S is taking the threshold-based policy or not. But for example, 
if D knows that S is adopting the threshold-based policy with 
threshold Ps, since D has CSI, when the channel can not meet 
the transmit rate R, D can choose not to receive. In this way, 
when B\) > Pp, 


pt _ 
^D — 


0 , \h\^< 
Pd, else, 


2 . (2'' - 1)^ 


Ptx 


(43) 


where Ptx = —We have the following results for 
disjoint/joint threshold-based policies. 

For the disjoint policy, the following corollary of Theorem 
1 holds for the probability T'(Ps,Pd)- 

Corollary 1: Assume EH processes of S and D are mutu¬ 
ally independent, and BSI is not shared, but D knows that S is 
taking the threshold-based policy with threshold Pp- We have 
Lk(Ps, Pd) under the disjoint threshold-based policy given by 


T'(Ps,Pb) = 
As 


1 , 


Ab 


AsAb 


Ps’ PB[l-p(Ptx)]’ PsPB[l-p(Ptx)] 


(44) 


where p(Ptx) is the channel outage probability defined in ([T]|, 
with Ptx = 

Proof: See Appendix iGl ■ 

Comparing Corollary [T] and Theorem [T] we can see that 
knowing the policy of S' at P improves the receiving prob¬ 
ability so that the outage performance is improved. We can 
further prove the optimality of disjoint threshold-based policy 
as the following corollary of Theorem |2l 

Corollary 2: Assume EH processes of S and D are mu¬ 
tually independent, and BSI is not shared, but D knows the 
















policy of S. Assuming no retransmission, among all stationary 
power control policies, the disjoint threshold-based policy is 
optimal. The optimal threshold Pg is the same as given 
in Theorem 12] 

Proof: See Appendix iHl ■ 

The above argument indicates that, the policy information 
available at D improves the outage performance, but it has no 
impact on the optimal policy at S, when the BSI is not shared. 
But the conclusion does not hold when the BSI is shared. 

For the joint policy, S knows whether D will receive or not, 
and thus when D knows its policy, which means that the BSI 
is shared and S also gets the CSI indirectly. As a result, there 
are full BSI and CSI sharing between S and D. We have the 
following corollary for '^{Ps,Pd), and the proof is simply 
based on the proof of Corollary [T] and Proposition [T] 

Corollary 3: Assume D knows that S is taking the 
threshold-based policy with threshold Ps, the probability 
T'jPs, Po) under the joint threshold-based policy is 


T'(Ps,Pd) = min 1 


As 


A d 


[l-p(Ptx)]Ps’ [1-p{P.)\Pd 


(45) 



1+a 

Based on this corollary, we can easily get the optimal threshold 
Pg for the joint policy when the CSI and BSI are shared. 


Corollary 4: When D knows that S is taking the threshold- 
based policy with threshold Ps, the optimal threshold Pg of 
the joint threshold-based policy satisfies 


exp 


{2^-l){l + a)z 

n - Pc,s . 


As 

p* ’ 


(46) 


where Pg > Pc,s- 

Remark that when both CSI and BSI are shared, the 
threshold-based policy is no longer optimal, while the trun¬ 
cated channel inversion ll29l can be proved optimal, which in 
our case has no closed-form expression, and is not presented 
in detail for the conciseness of this paper. 

As for the receiver detection and processing, first note that 
in Proposition |3] D does not know the policy of S. If D knows 
the policy of S, Proposition [3] has the following corollary, of 
which the proof is simply based on the proof of Corollary [T] 
and Proposition [3] 

Corollary 5: Under the disjoint threshold-based policy with 
receiver detection and processing, if D knows the policy of 
S, we have 


T'n = min | 1, 7-^---;--7- | , 

(47) 

where ^ and p are the detection cost and the receiving fraction 
respectively, and T's = min{l, and p(Ptx) is the channel 
outage probability defined in ([T]l, with Ptx = . 


E. K > 1 and Linear Power Levels Policy 

When retransmission is allowed, i.e., K > 1, the anal¬ 
ysis becomes much more complicated. Mainly because the 
transmission probability 'k will be correlated among adjacent 
slots due to the existence of battery. Even if we constrain the 



Fig. 3. Average outage probability versus energy arrival rate for R = 
2bit/Hz/s, Po = 700mW, 2 = lOOmW, p = K = A, = + 00 , and 

other parameters according to Section I VII U nder this setting, Sth = 787mW, 
and Pg is determined by (ID and (ID for disjoint and joint policies, 
respectively. 


threshold-based policy to be invariant among retransmission 
states, the optimal threshold may be different from our previ¬ 
ously derived Pg. Nevertheless, the derived Pg is near-optimal 
with AT > 1. To illustrate this, consider an equivalent non-EH 
fading channel with average power constraint As and Ad for S 
and D, respectively. Eollowing Lemma [3| the optimal policy 
to minimize the per-slot outage is threshold-based. Like the 
disjoint policy, D has no choice but to receive with probability 
min(l, if it does not know the action at S. Eor this fading 
channel, threshold Pg in (l35T l is the optimal threshold policy 
among all policies being invariant over retransmissions. The 
outage probability is then 

pTHPs) = - HPs))^, (48) 

where 4>{Pg) is the same as the definition in (ffSl l. and the 
above equation holds since the successful transmissions among 
adjacent slots are independent for non-EH fading channel. This 
serves as the lower bound of the outage probability for the 
counterpart EH link. Therefore, using the same threshold Pg, 
the outage probability of the EH link satisfy 

Pout{P*s) > p:T^{P*s) = (1 - ^{P*s))'^- (49) 

Eor joint threshold-based policy, the same conclusion holds. 
To show that the derived Pg is close to optimal, we simulate 
the outage probability as shown in Eig. [3] from which we can 
see that Pout(-P5) is very close to its lower bound P^ouPiPs)’ 
for both joint and disjoint policies. 

Different from the other two, the linear power levels policy 
considers the retransmission state u. Given the intervals of the 
sequence A and the transmission limit K, we need to find 
the optimal starting power level Pgi^. Since closed-form pout 
is not available, by now we can only heuristically choose the 
starting power. Eor the counterpart non-EH fading channel, if 
linear power levels policy is applied, to keep the average power 
consumption the same, the starting power E!s,ip < Pg. Using 
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this heuristic, in Fig. |3] we set Ps,\p = 0.8Pg to the EH case, 
and set A = lOOmW for As < 700mW, and A = 200mW 
for As > 700mW. We observe that linear policy outperforms 
disjoint policy under medium to high energy arrival rates, and 
its gain increases when energy becomes more sufficienj^. This 
phenomena will also be discussed in Section IVll where we can 
use FSMC to look for the optimal Pg jp. 

V. Finite Battery Capacity 

In this section, we analyze the performance and parameter 
optimization for the three policies where both nodes are 
equipped with finite batteries with capacity i?niax < +c>o. 
We implement our model using discrete-time FSMC, in which 
we quantize all energies with respect to a energy fraction 
E 0 . Accordingly, the battery levels are considered to be 
integers multiple of E. Here we consider temporal independent 
Bernoulli distributed energy arrivals at S and D, but similar 
analysis can be extended to other types of distributions, which 
will be discussed at the end of Section IV-AI 


A. FSMC Formulation 

The discrete-time FSMC has the state space S = BsxBdx 
U, where S/3 = {0,1, • • • , Smax} iP € {S, D}) is the set of 
battery states of node /3. Recall that U = {—1, 0,1, • • • , AT—1} 
is the set of possible packet transmission attempt states. Thus, 
the state of the link at time t is denoted by s* = {bg, 
where € Bp and u* € lA. 

For the energy arrival processes, a Bernoulli model 
is considered. At the end of slot t, E'g^^ and 
^max jgygjs Qf energy are injected into the transmitter and 
the receiver with probability ns and no, respectively. With 
probability (1 — ns) and (1 — no), no energy is harvested. 
For example, with ns = IJ-D — their mean values are 
Xs = E[i?g] = ^E'g^^ and Xjj = Similar 

model is also used in Qt) for wireless sensor networks (WSN). 
We also consider the correlation between the EH processes of 
S and D with correlation coefficient p. 

Therefore, the harvested energy pair in slot t can be written 
as 


{Eg, E^j) 


( 0 , 0 ), 

(0,Er’'), 

(Er",0), 

( TTimax TTTnaxA 


with probability /tq, 
with probability ni, 
with probability n 2 , 
with probability n 3 , 


where t = 0,Each of the probabilities = 
0,1,2,3) can be calculated depending on ns, and p 
with the following constraints: (1) = 1 ™d (2) 

Ml = M2, where the second constraint comes from our setting 
of changing to control Xp, /3 G {S', E}. 

Denote the state transition probability matrix as T, whose 
elements represent the probability of a transition from state 


= {m,n,v) to state = {i,j,u), i.e., 

vyy. = pr = b =j, = u\ 

Bg = m, Bjj = n,u^ = v} , 


(51) 


where m,i G Bs, n,j G Bd and v,u G U. Fet 
<5(s*, a*(M), denote the Kronecker delta functioijl from 
state s* to with an action a*{u) = [PsiFP^PiA taken in 
slot t, with retransmission state u, and 


<5(s*,a*(u),s‘+i) = 


1, s* transits to with a*{u), 
0, else. 


For the ease of exposition, in the following we drop the node 
indexes S and D to consider one of the two nodes, e.g., 
we use n to denote the EH probability at each slot. For Vu 
and u = -1, = m( 1-p(Ptx))i5(s*,a‘(u),s*+^) if 

a transmission successes. For v = K — 1 and u = 0, or 

V = —1 and u = 1, or u 3A — 1,E — 1 and u = v + 1, 
Tpt = np{Ptx)5 {s^,a^{u),s*~^^) if a transmission fails 
because an outage occurs. For v = K — 1 and u = 0, or 

V = —1 and u = 1, or u 3A — 1,E — 1 and u = v + 1, 

Tft = (s*, a*(M), if a transmission fails due to 

insufficient energy states. Note in above explanations, the 
probability P^) is reflected in i5(s*, aX{u), s*“*"^), where 

the action a*{u) = [Eg(it),E{)] varies according to different 
power control policies as proposed previously. Specifically, 
for disjoint/joint threshold-based policy, Pg does not depend 
on u. For D, when the receiver detection is considered, P^ 
is decided by (|4|i. When D knows the policy of S, P^ is 
decided by (l43T l. and takes zero value with probability 
p{Ptx) and this must coincide with the calculation of Tft 
When both detection and processing are considered, P^) is 
decided by (l40l i with channel outage probability p(Etx), which 
should also coincide with the calculation of Tft . Note that 
^ = 1 and T] = 1 represent the cases without receiver detection 
or processing. 

The stationary probabilities 7r(s) of this FSMC can be 
obtained by solving the balance equatior0. 


7^(s) = ^Pr{s*+^ = {i,j,u)\s* = {m,n,v)}Tr{s). (52) 

S 


Under the ACK-based retransmission mechanism, an outage 
occurs if and only if a packet is still not transmitted suc¬ 
cessfully even after E — 1 times of retransmission. We focus 
on the following problem of minimizing the average outage 
probability 

min Pout, (53) 

Ps>Pc,s 


where the optimization variable Ps is the power threshold 
for the disjoint/joint threshold-based policies and is the power 
levels threshold for the first attempt in the linear power policy, 
and the step A is given. The average outage probability po^t 
is decided in the following proposition. 


^Its performance can be better than the non-EH fading channel outage lower 
bound of joint/disoint policies as these policies ai‘e retransmission-invariant. 

"^We use the unit Joule in our discretized model with the unit slot length, 
where the energy and power are quantized by the same value E joule. 


^Without ambiguity with the Dirac delta function used in Appendix m as 
here three parameters are used. 

^Under mild conditions on the transmission power policy, we can show that 
the FSMC is in'educible and positive recun'ent, which is not detailed here. 
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Proposition 4: The average outage probability is given by 


Pout — 


7r„=o(s) + 7r„=_i(s) ’ 


(54) 


where 7r(s) is the stationary probability that the link is in state 
s = {bs,bD,u), and 7ru=k(s) is defined as 


T^n=k(s) = y] TT{bs,bD,U = k). 


Proof: Notice that 7ru=o(s) represents the probability 
that a transmission outage occurs in the previous slot, and 
7 ru=-i(s) represents the probability that a successful transmis¬ 
sion is finished in the previous slot. Otherwise, in the states s 
where u — 1 , 0 , the packet will be under retransmission in 
the current slot. Assuming we have N slots and N +oo to 
reach the stationary distribution, then the number of packets 
transmitted successfully is A"success = and the 

number of packets transmitted with an outage is A^outage = 
lV7ru=o(s). Thus, the average outage probability is given by 
the ratio of the packets that are not transmitted successfully 
even after K —1 times of retransmission: 


Pout — 




outage 




outage 




(55) 


and we get the proposition. ■ 

Similarly, the average transmission times r is given by 
a weighed function of the stationary probability 7 r(s). First 
define. 


Pu,v= ^ Pr{s*+1 = = (m,n,u)} , 

and then 

^^l+ ^k=i'kp-i.k7rv=k(s) ^ (56) 

7r„=_i(s) 

We can now summarize the procedure for computing the 
outage probability pout and the average transmission times r: 

1) Discretize the battery levels and the power consumption 
levels to generate the FSMC with state space S, 

2) Compute T, where T is the state transition probability 
matrix with entries defined in dSD, 

3) Obtain the stationary probabilities 7r(s) by solving (l5^ . 

4) Obtain Pout and r from (l54l i and (l56l l. respectively. 
Remark 1: In the infinite battery capacity case where 

Umax = + 00 , it is obtained that if As > Ps and Ad > Pd, 
the energy is always enough, and the battery states will 
have the behaviors Pp = +c» (/? € {S',!?}). Thus the state 
space will become a simpler set denoted as Sjj = U = 
{ —1,0,1,- -!T—1}. The corresponding transition matrix 7u 
is of dimension {K + 1) and the stationary distribution can be 
easily obtained as 

ttd =-p—(57) 
1 — 

where p = p(Ptx)- Through ( |57] |. pout = P^ (which means K 
times of transmission fail continuously) where p is obtained 
through (4). However, for the general value of Pmax, it is hard 
to get the closed-form of the stationary distribution 7r(s). 


Algorithm 1 One Dimensional Searching Algorithm 
1 
2 

3 

4 

5 

6 

7 

8 
9 

10 
11 


Initialize P 5 = 0 and pout = 1; 
for Ps = Pc,s, Ps e Ps do 

Calculate the transition matrix T via (fSTl l and determine 
the stationary distribution tt via (l52] l: 

Obtain the outage probability pout via (El; 
tf Pout ^ Pout then 

Pout Pouti 


P*s^ 

D* _ 

-f^tx — 


end if 

Ps = Ps 

end for 


Ps-, 

Ps—Pc,S . 

l+a ’ 


E-, 


Remark 2: Although our analytical results in Section |IV] 
hold for general stationary and ergodic EH process, we restrict 
our FSMC analysis to i.i.d. EH sequences. Because the main 
focus is on the correlation between the EH processes of S 
and D. According to the results in Section IIVI the temporal 
correlation of the EH process will not have notable impact on 
the outage performance when the battery capacity is large. 
Nevertheless, temporal correlation may have severe impact 
on the outage performance with small battery. In fact, our 
ESMC formulation can be extended to tackle the temporal 
correlated EH process. Specifically, the current system state S 
has to be expanded to be five-dimensional, including Eg and 
Ejj as additional state attributes. This however imposes high 
calculation complexity, which should be addressed in future 
studies. 

B. Local Searching Algorithm 

Eor the disjoint threshold-based policy and the joint 
threshold-based policy requires carefully choosing the optimal 
threshold Pg € Bs- Eor the linear power levels policy, 
one also needs to find the optimal starting point Pg jp = 
Ps{—1) = Ps{0), for given K and the interval A 0. Since 
it is not possible to directly get the closed-form solution 
of Pout, a one-dimension local searching algorithm shown 
in Algorithm 1 is implemented to find the optimal Pg and 
the corresponding P^x- This algorithm requires searching all 
the states of the energy storage at the transmitter and is of 
complexity 0(Pmax + !)■ We also have the transition matrix 
T € R(Smax+i)^x(i<'-(-i)^ jjjg detailed T will be different 
according to different power control policies. 

VI. Numerical Results 

In this section, numerical results are presented to show the 
performance of the proposed power control policies. To use 
ESMC, the state space is discretized with the granularity of 
E = 50mW, and thus the energy and power are of the same 
quantization granularity. We consider temporal independent 
Bernoulli energy arrival with parameters Pmax = 20P and 
As = Ad = 500mW= lOP, and Pc,s = 100mW= 2P, 
Pd = 700mW= 14P corresponding to the coded system ifTSll 

^In the algorithm, note that E represents the discretized energy unit. 
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Fig. 4. Average outage probability with varying Smax and detection cost 
for R G {1, 2}bit/Hz/s, p = 0.5, K = A. 


II20I . The noise power z = lOOmW. We also set Pp = Pp, 
i.e., 77 = 1. Given the correlation coefficient p, we can obtain 
Pi{i = 0,1, 2,3) from (l50l l. Parameter a = 1 corresponding 
to the drain efficiency of 0.5 for a typical class-AB amplifier 
ll27l . and thus Pg = 2P(‘ + Pc,s- For the retransmission 
mechanism, unless otherwise mentioned, we allow K — A 
times of transmission (3 times of retransmission), and thus 
the retransmission state set U = { —1,0,1, 2, 3}. In all sets 
of results, the battery capacity is finite, so that the optimal 
power control policy parameters, i.e., Pg for joint/disjoint 
threshold-based policy, Pg^^ for linear power levels policy 
(with A = lOOmW) are decided by the FSMC-based local 
search in Algorithm 1. Remark that the linear policy under 
evaluation in this section is with ^ = 1. To evaluate the 
gain provided by knowing the policy of S at D, we provide 
corresponding results for joint threshold-based policies. In 
other policies, this setting is default, i.e., D does not know 
the policy of S. 

First in Fig. 0] the impact of battery capacity Bmax is 
observed. For all policies, the outage performance degrades 
when Bniax decreases. Because of the energy overflow with 
finite battery, the effective energy that can be used is reduced, 
resulting in more outages due to power shortage. With receiver 
detection, the energy at D is used more efficiently, and thus 
we observe less outage with smaller value of detection cost 
f. It is also shown that the linear policy is worse than the 
disjoint policy with small battery capacity. As the linear policy 
requires larger transmission power for retransmissions, small 
battery cannot provide enough energy, so that the transmission 
probability Ps{u) for retransmissions u > 1 is reduced, 
which degrades the performance. While for large battery, linear 
policy shows performance gain over disjoint policy, which 
coincides with our results for infinite battery in Fig. [3 

We then show the results of varying energy arrival correla¬ 
tion p in Fig. |5] It is observed that energy arrival correlation 
helps improve the outage performance. Because correlation 



Fig. 5. Average outage probability with varying p and detection cost for 
i? G {1, 2}bit/Hz/s, Bmax = 3As = 1500mJ, K = A. 


has the effect of “matching” the energy states of S and 
D, which can increase the probability that both nodes have 
enough energy to transmit and receive in the same slot. With 
infinite battery, as shown in Proposition [T] and Theorem [2 the 
performance of the joint policy is irrelevant to p, but under 
finite battery, its performance still improves with increasing p 
slightly. As the battery capacity is only Bmax = 3As, linear 
power levels policy performs the worst. 

In Fig. 0] and FigjS] we also provide curves for the perfor¬ 
mance of disjoint policy when D knows that S is taking the 
threshold-based policy. It turns out that knowing the policy of 
S at D lets disjoint policy perform close to joint policy, and 
even better than joint policy when R is small, and they are 
both insensitive to p. The reason of their similar performance 
is that if D knows the threshold of S, its energy is saved 
by avoiding unnecessary receiving when channel undergoes 
deep fading, while joint policy saves the energy of D by 
avoiding unnecessary receiving when S is not transmitting. 
We should remark that which of them performs better should 
depend on system parameters including channel statistics, rate 
requirement and EH profiles. 

To illustrate the procedure of searching for optimal threshold 
Pg with Algorithm 1, in Fig. |6] the outage performance with 
different values of Pg is shown. The optimal Pg for each 
policy is labeled with a star symbol. Since p = 0 and the 
battery capacity is relatively large, i.e., i^max = 6As, the 
optimal thresholds are compared with those derived under 
infinite battery and no retransmission. For infinite battery with 
R = 2bit/Hz/s, according to Theorem 121^5 = 787mW for 
disjoint and joint threshold-based policies. While in Fig. |6] 
due to the quantization granularity and the existence of re¬ 
transmission, the optimal Pg is SOOmW and 900mW for these 
two policies respectively. The case of disjoint policy is very 
close to our theoretical result, while it seems that joint policy 
is more sensitive to the existence of retransmission. With 
R = Ibit/Hz/s, similar observations are made. Please also note 
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Fig. 6 . Average outage probability with varying Ps for disjoint/joint 
threshold-based policies and PsH^) for linear power levels policy, for R G 
{1, 2}bit/Hz/s, Bmax = 6 As = SOOOmJ, ^ = 1, and p = 0, AT = 4. 



Fig. 7. The tradeoff between the minimum pout and the corresponding 
average transmission times r, with different maximum transmission times 
K G {2, 3, 4,5}, for R G {1, 2}bit/Hz/s, = 6 A 5 = 3000mJ, ^ = 1, 

p = 0.5. 


that the optimal starting power of the linear policy is smaller 
than Pg of the disjoint policy, which confirms our intuition 
in Section IIV-EI With large battery, the linear policy shows 
better performance than the disjoint policy, and the gain is 
more evident with smaller R. Fig. |6] also shows the results 
for disjoint policy when D knows the policy of S. Although 
the optimal outage probability is better than the default disjoint 
policy, the optimal threshold is the same, which coincides with 
Corollary |2] 

Finally, we show the tradeoff between the average outage 
probability pout and the average transmission times r in Fig. 
I2I We change the number of maximum transmission times 
K S {2,3,4, 5} and get different pairs of (r, pout). i-C-. the 
four points on each curve. When we set higher K, pout goes 


down. For K = 2, the linear policy shows better performance 
over the disjoint policy. We also try a smaller value of 
Pd — 100mW, corresponding to uncoded system iTTSl . In this 
case, the smaller Pd, together with the smaller R = Ibit/Hz/s, 
provides relatively sufficient energy availability, the linear 
policy can achieve the best outage probability, with the cost 
of larger delay. 

VII. Conclusion 

We have investigated the outage minimization problem 
for a fading wireless link with FH transmitter and receiver. 
Three power control policies are proposed, namely the disjoint 
threshold-based policy, joint threshold-based policy, and the 
linear power levels policy. For infinite battery and independent 
energy arrivals, we have proved that the two threshold-based 
policies are optimal among all retransmission invariant power 
control policies, with or without BSI sharing between S and 
D, respectively. Specifically for the joint threshold-based pol¬ 
icy, the optimal threshold and its performance are independent 
of the energy arrival correlation p between S and D. We 
also consider the receiver detection, receiver processing, and 
different levels of BSI and policy information sharing in 
our analysis. To analyze the performance with finite battery 
capacity, we formulate the FH link into an FSMC, with which 
the optimal power thresholds can be calculated. With extensive 
numerical tests via FSMC, we have shown the performance of 
the three policies under different values of the battery capacity 
Bmax and the energy arrival correlation p. It is found that 
larger correlation p actually improves the outage performance. 
We also show that knowing the policy of S' at D can enhance 
the performance of disjoint policy to be comparable to joint 
policy. Unlike the infinite battery case, the linear policy shows 
its performance gain over disjoint policy only with larger 
battery capacity or lower R. Finally, the tradeoff between the 
average outage probability and average retransmission times 
is presented. 

Possible extensions of this work are as follows. First, in 
this paper we consider Lid. block fading channel, while for 
Markovian channel, S can exploit the ACKs to estimate the 
channel gain and even learn the FH process of D. MDP based 
schemes can be promising candidates. Moreover, when the 
power consumption in the idle state (when the transceiver 
is not transmitting or receiving) and the switch-on/off delay 
are non-negligible, an appropriate node sleeping policy ED 
should be designed, and can be jointly optimized with the 
power control policy. 

Appendix A 
Proof of Femma[3] 

Denote the power control policy as a probability dense 
function (PDF) g{P), since S only has CDI. From ([T]l, write 
the per-slot transmission successful probability as the function 

of the total power consumption P: f{P) — e ^-^c,s for 
P > Pc,s, otherwise f{P) = 0. The objective is to find the 
optimal g{P) 

P + OO 

rnax / g{P)f{P)dP, 
g(P) Jo 


( 58 ) 
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subject to /q^°° Pg{P)dP = Pq- It is easy to check that f{P) 
is convex over [0, Pb] and concave over [Pb, +oo), where Pb = 
(,i+a)(2 -1) Furthermore, by solving = /'(Pfl 

we get Pa a One can check that Pa > Pb, and geometrically 
the line from (0,0) to [Pa, f{Pa)) is a tangent to f[P). We 
thus construct a function 


q[P) 


.f{Pa)P 
Pa ’ 

f{P), 


P&[0,Pa], 

P G [Pa, +CX)], 


(59) 


which is a continuous and concave function as Pa > Pb- The 
functions and points used in the proof are illustrated in Fig. 0 
Since f[P) < q[P), then \/g[P) 


^+oo 


g[P)f[P)dP < 


n-\-oo 


g[P)q[P)dP (60) 


Denote 

p + CXl 

p(P)=argmax/ p(P)g(P)dP, (61) 

a{P) Jo 

subject to Pg[P)dP = Pg. Since q{P) is concave, for 
the random variable P with PDF g{P), we have lISTl (3.5)] 


E{g(P)} < g(E{P}), 


i.e.. 


p + OO 


g[P)q[P)dP < q[Po), 


(62) 


and the equality holds when we set g[P) = S[Po), where 
5[x) is the Dirac delta function, which means that transmitter 
always transmits with Pg power. Let 


r S[Po), Pg > Pa, 

^ " I (1 - + ^S[Pa), Po < Pa- 

V J a -^a 

Note g*[P) satisfies /q^°° Pg*[P)dP = Pg. We then have 


^+oo |•-\-oa 

/ g*[P)fiP)dP^^ / g*[P)q{P)dP 
Jo Jo 

= /^“g(P)g(P)dP, 

Jo 


(64) 


where (a) holds due to the definition of q[P) and g*[P), and 
[b) holds since q[P) is linear in [0, P^j. Therefore, according 
to (l60l l and (IbTT i. g*[P) is the optimal solution to ( fSST l. 


Appendix B 
Proof of Theorem[2] 

For policies that do not depend on u, the objective is to 
maximize the per-slot successful probability </>. Since energy 
arrivals {Eg} and {P^} are independent, multiplying 
with /(P) (defined in Appendix |3l we get the successful 
probability of a transmission from S as /(P) = T'd/(P). As 
'ku is constant, /(P) keeps the properties of /(P) used in 
the proof of Lemma |3] and |4] and thus the optimal policy at 
S is threshold-based. 

*For P > Pc,S, as f(P) is differentiable on {Pc^s, +oo). 

^The larger solution to the quadratic equation {P—Pc s)^ = (1-I-Q!)(2^ — 
l)zP. 



Fig. 8. In the illustrative figure we set R = 0.5bit/Hz/s, a = 1, Pq g = 
200mW, 2 = lOOmW. 


To find the optimal threshold Pg, we need to solve (PI) 
in ( fTSi) . where '^{Pg,Po) is given in Theorem [1] Note c = 
(2 «-l)(l + a)z, Pjj, Xg and are all positive constants. 
We study the following cases. 

Case 1: As > Pg- In this case, ^[Ps,Pd) = 
min ^1, is independent of Pg. Thus 4>{Ps) = 

e( ^s-”(’c,s) ^![Pg,Pjj), and by obtaining the first-order 
derivative we have 


dc^[Ps) 

dPs 



c 

[Ps - Ps,cf 


> 0 


holds for VPs. This further yields that (j)[Ps) is an increasing 
function in its domain Ps G [0, As)\{Pc,s}- Obviously, Ps > 
Ps,c should hold. 

Case 2: As < Ps- In this case, 'I'(Ps,Pd) = 
min ^1, Solving — 0 leads to 


cPs = [Ps - Pc,s?. (65) 


By solving the above equation we have 

\ (2Pc.s + c)±c^ (4Pc,s + c) = , (66) 

and PjIj < and we can check that < Pc,s, which is 
not reasonable. So we define Pth = P*’ ™tl it is guaranteed 
that P2 > Pc.s. If As < P,h, then < 0. 

S Pg—Bih 

Thus Pg = Pth if As < Pth- Otherwise, if As > P*, ({’[Ps) 
decreases in [As,+oo), and hence Pg = As. 

From the two cases above, we finally have Pg = 
max (As, Pth)- The proof is completed. 


Appendix C 

Proof of Proposition[T] 

First, if max(^, ^) > 1, then following the proof of 
Lemma [T] one or both of the two nodes always have enough 
energy to transmit or receive. The problem degenerates to 
calculating the one side transmission or receiving probability. 
Then according to Lemma [T] and |2] we have 'I'(Ps,Pd) = 
min(T's, 4 'd) = min(l, ^). 

If max(^, j^) < 1, due to the constraint that the used 
energy should be less than the arrived energy, we have 

'^[Ps,Pd)Pi3 < A/3, 
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where (3 G {S^ D}, so that 

(67) 

Without loss of generality, assume ^ then 

'i>{Ps, Pd)Pd < Ad, which means D always have enough 
energy to receive. The problem degenerates to calculating 
the transmission probability at S, and thus from Lemma [T] 

’^{Ps.Pd) = ^. 

For the special case of ^ we must have 

T'(Ps,Pd) = ^ = Otherwise, according to ( l67T i. 
'^{Ps, Pd)Pp < A /3 (/3 G {5,77}), which means that there 
will always be sufficient energy at both sides for transmission 
and receiving, which contradicts the fact that T'(Ps,Pd) < 
min(^,^)<l. 

We thus complete the proof of the proposition. 


otherwise the detection is useless. As a result, the equivalent 
energy arrival rate is 


Ad + (1 — C}Pd{^ — 


Following similar procedures of proving Lemma m if Ad + 
(1 — ^)Pd(1 — > Pd, then 'Ld = 1- This requires 

Ad > [1 — (1 — ^)(1 — 'Ls)]Pd- Otherwise, if Ad + (1 — 
^)Pd(1 — < Pd, according to Little’s Law ll30l . we 

have 


'Ld 


Ap + (1 — C)Pd(1 — 'L5)T'd 
Pd 


(69) 


and so that 


T'd = 


_Ap_ 

[1-(1-C)(1-T's)]Pd' 


(70) 


This completes the proof. 


Appendix D 
Proof of Theorem[3] 


Denote tp = lim 


■n—¥-\-oc 


■ ELi E 


'-B%>Ps,B*>Pd 


under 


any stationary power control policy not depending on u at 
S. Due to the same reason as for (163, ^ If Ad > 

Pp, following the proof of Lemma [1] D always has enough 
energy, and thus according to Lemma |4l the optimal policy at 
S is threshold-based. And according to Theorem|2] the optimal 
threshold Pg = max (As, Pth)- 

If Ap < Pp, consider a non-EFI S with average power 
constraint As first. Any policy at S can be characterized as 
“transmit with power consumption PDF g{P) whenever P}^ > 
Pp”, as S knows P{j. For the set of policies with the same 
T'p (the probability at any slot that P{j > Pp), denote the 
corresponding power PDF as p(P)|^Q. Following the similar 
proof procedure as in Lemma [3 the optimal g{P)\^o must 
have the same structure as dhTt . Therefore the optimal policy 
must be threshold-based. Denote the transmission threshold as 
Ps, following the same procedure in the proof of Proposition 
[T] we get the same transmission and receiving probability 
'^Ps.Pd for rhe EH S case. Therefore the optimal solutions 
of the two systems are the same, which indicates that for EH 
S, the threshold-based policy is optimal. 

Einally, solving 

max (l){Ps) = exp- - - min(l, —, —) (68) 

Ps>0 \ Ptx / Ps Pd 


will get the optimal Pg in the theorem. 


Appendix E 

Proof of Proposition[3 

Similar to the proof of Proposition |2] an energy packet 
queue Q is constructed for node D. But the mount of energy 
gets back to the “source” is different. Recall that only when 
> Pp will D try to detect and receive, and otherwise the 
detection is useless. As a result, the equivalent energy arrival 
rate is 


Ap -I- (1 — ^p)Pp(l — 'I's)'kD + (1 — 'r])PFP{Ptx)'i’s'^D, 


(a) 


(b) 


where T's = is the transmission probability of 

5, and; (a) represents the case that S is not transmitting, and 
thus (1 — ^r])PF of energy is put back to the energy queue; (b) 
represents the case that S is transmitting but channel outage 
occurs (with probability p(Ptx)), and thus (1 — r])PF amount 
of energy is put back to the energy queue. Eollowing similar 
procedures of proving Lemma m if Ap + (1 - £,g)PF{l - 
T's)T'd + (1 - g)PFP{P\.x)'^D > Pf, then T'p = 1. This 
requires Ap > [1 - (1 - ^p){l - T's) - (1 - g)p{Pxx)^ s]Pf- 
Otherwise, according to Little’s Law ESI, we have 


T'p = 


Ap-I-(1 — ^r7)PF(l — T's)T'd + (1 — t7)-PFp(Ptx)T's'I'D 


and so that 
tkp = 


Pf 

Ap 


[1 - (1 - ^p){l - T-s) - (1 - g)p(Pxx)^s]PF 

This completes the proof. 


(71) 
. (72) 


Appendix E 

Proof of Proposition[3 

Similar to the proof of Lemma [T1 an energy packet queue 
Q is also constructed for node D. The difference is that, in Q 
whenever an energy packet, of volume Pp, is served by the 
“server”, some energy with amount of (1 — C)Pd gets back to 
the “source” instantaneously with probability 1 — rPs, where 
rps = min{l, is the transmission probability of S. This 
is because D only spends ^Pp when S is not transmitting. 
Recall that only when P}) > Pp will D try to detect, and 


Appendix G 
Proof of Corollary[T] 

When D knows that S is taking the disjoint threshold-based 
policy with threshold value Ps, T'p is different from (|29] | in 
Lemma 12] while rPs remains the same as (fTsT i in Lemma (Tj To 
calculate tkp, similar to the proof of Proposition |2| an energy 
packet queue Q is constructed for node D. But the mount of 
energy gets back to the “source” is Pp with probability p{Ptx), 
corresponding to the occurrence of channel outage. Because 
D knows the threshold power Pg and the CSI, if the channel 
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is in outage, it is not necessary to spend any receiving energy. 
As a result, the equivalent energy arrival rate is 

Ad + p{Ptx)PD'i'D- 

Following similar procedures of proving Lemma m if Ad + 
p{Pt!.)PD'^D > Pd, then 'Fd = 1- This requires Ad > [1 — 
F(Ttx)]Tb- Otherwise, if Ad +p(Ftx)Tb'hD < Pd, according 
to Little’s Law ESI, we have 


T'd = 

Ad + piPtx)PD'^D 

(73) 

Pd 

and so that 

Ad 

“ [1-p{P,x)]Pd' 


T'd 

(74) 

As a result, given 'I's 

= min ^1, , and 

since the events 


Bg > Ps and > Po are mutually independent, the 
corollary is proved. 


Appendix H 
Proof of Corollary[2] 

Denote the power control policy as a probability dense 
function (PDF) g{P), and denote the per-slot transmission 
successful probability with transmission power P as f{P) — 
1 — p{P), where p{P) is the channel outage probability 
defined in ([T]l. In this way, the average transmission successful 
probability is 

p+oo 

Psuc = / g(P)/(P)dP. (75) 

Jo 

Following the way of deriving 'I'd in Appendixby replac¬ 
ing 1 — p{Ptx) (for disjoint threshold-based policy) with psuc 
(for power control policy described by g{P)), we get 


T'd = min 


PsuqPd ) 


Then, the average successful probability (j) becomes 


4 > = PsucT'd = min 



(76) 


(77) 


where the first equality holds since the channel condition 
and the choice of the transmission power are independent, 
as S only has GDI. From this equation, it is easy to see 
that maximizing (j) is equivalent to maximizing psuc- As a 
result, according to Lemma [2 and 0] the optimal policy at S 
is threshold-based, and the optimal threshold Pg is the same 
as the one in Theorem |2 
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